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>LGR4 nucleotide sequence (SEQ ID NO: 01) 

ATGCCGGGCCCGCTAGGGCTGCTCTGCTTCCTCGCCCTGGGGCTGCTCGGCTCGGCCGGGCCCAGCGGCGCGGCGCCGCCT 
CTCTGCGCGGCGCCCTGCAGCTGCGACGGCGACCGTCGGGTGGACTGCTCCGGAAAGGGGTTGACGGCCGTACCGGAGGGT 
CTCAGCGCCTTCACCCAAGCACTGGATATCAGTATGAACAATATCACCCAGTTACCAGAAGATGCATTTAAGAGTTTCCCA 
TTTCTAGAGGAGCTACAACTGGCTGGTAACGACCTTTCTCTTATCCATCCAAAAGCCTTGTCTGGGCTGAAAGAACTCAAA 
GTCCTAACACTCCAGAATAATCAGTTGAGAACAGTGCCCAGTGAAGCCATTCACGGACTGAGTGCTTTGCAGTCTTTACGC 
TTAGATGCCAACCATATTACCTCAGTCCCGGAGGACAGTTTTGAAGGGCTTGTCCAGTTACGCCATCTGTGGCTGGATGAC 
AACAGCTTGACGGAAGTGCCCGTGCGTCCCCTCAGCAACCTGCCAACCCTGCAGGCGCTGACCTTGGCTCTCAACAACATC 
TCAAGCATCCCTGACTTCGCTTTCACCAACCTTTCAAGCTTGGTGGTTCTGCATCTGCATAACAATAAAATTAAAAGCCTC 
AGTCAACACTGTTTTGATGGACTAGATAACCTGGAAACCTTGGACTTGAATTACAATTACTTGGATGAGTTTCCTCAGGCT 
ATTAAAGCCCTTCCCAGCCTTAAAGAGCTGGGATTTCACAGTAATTCTATTTCTGTTATTCCTGATGGAGCATTTGGTGGT 
AATCCACTGCTAAGAACTATTCATTTGTATGATAATCCTCTGTCTTTTGTGGGGAACTCAGCATTTCACAACCTGTCTGAT 
CTGCATTGCTTAGTCATTCGTGGTGCAAGCCTGGTGCAGTGGTTCCCCAATCTGACCGGAACTGTCCATTTGGAGAGTCTA 
ACCTTGACAGGGACAAAAATAAGCAGCATACCTGATGATCTGTGCCAAAACCAAAAGATGCTGAGGACTCTGGACTTATCT 
TATAACAATATAAGAGACCTTCCAAGTTTTAATGGTTGTCGTGCATTGGAAGAAATTTCATTGCAGCGTAATCAAATCTCC 
CTAATAAAGGAAAATACTTTTCAAGGCCTAACATCTCTAAGGATTCTAGATCTGAGTAGAAACCTGATCCGTGAAATTCAC 
AGTGGAGCTTTTGCGAAGCTTGGGACAATTACTAACCTGGATGTAAGTTTCAATGAATTAACTTCATTTCCTACGGAAGGC 
CTAAATGGGCTCAATCAACTAAAGCTTGTGGGTAACTTCAAGCTGAAAGACGCCTTGGCAGCCAGAGACTTTGCTAATCTC 
AGGTCTCTATCAGTACCATATGCTTATCAGTGTTGTGCATTTTGGGGGTGTGACTCTTTATGCAAATTAAACACAGAAGAT 
AACAGCCCCCAAGAACACAGTGTGACAAAAGAGAAAGGTGCTACAGATGCAGCAAATGTCACCAGCACTGCTGAGAACGAA 
GAACATAGCCAAATAATTATCCACTGTACACCTTCAACAGGTGCTTTCAAGCCCTGTGAATATTTACTGGGAAGCTGGATG 
ATTCGCCTTACAGTGTGGTTCATTTTCCTGGTCGCCTTGCTTTTCAACCTGCTTGTCATTTTAACAGTGTTTGCGTCTTGT 
TCATCACTGCCTGCCTCCAAACTCTTCATAGGCTTGATTTCTGTGTCTAACTTACTCATGGGCATCTATACTGGCATCCTT 
ACTTTTCTTGATGCTGTGTCCTGGGGCCGATTTGCCGAATTTGGCATTTGGTGGGAAACTGGCAGCGGCTGCAAGGTAGCC 
GGGTCTCTGGCAGTCTTCTCCTCAGAGAGCGCTGTATTCCTATTAACACTGGCAGCTGTGGAAAGAAGCGTATTTGCAAAG 
GATTTGATGAAACACGGGAAGAGCAGTCACCTCAGACAGTTCCAGGTGGCCGCCCTCTTAGCTTTGCTGGGTGCCGCAGTG 
GCAGGCTGCTTCCCCCTTTTCCACGGAGGGCAATATTCTGCATCGCCCTTGTGTTTGCCGTTTCCTACAGGAGAAACCCCA 
TCGTTAGGATTCACTGTGACCTTAGTGCTATTAAACTCACTGGCATTTTTACTAATGGCCATTATCTACACTAAACTATAC 
TGCAACTTAGAGAAGGAGGACCTGTCGGAAAACTCCCAGTCTAGCGTGATTAAGCACGTTGCCTGGCTCATCTTCACAAAC 
TGCATCTTCTTCTGCCCTGTTGCATTTTTCTCAT.TTGCACCATTGATCACGGCAATCTCCATCAGCCCCGAGATAATGAAG 
TCTGTTACACTGATATTCTTCCCGTTGCCTGCTTGCCTGAATCCGGTCCTGTATGTTTTCTTCAACCCAAAGTTTAAAGAA 
GACTGGAAGCTACTGAAGCGGCGTGTTACCAGGAAACACGGATCTGTTTCAGTTTCCATCAGCAGCCAAGGCGGTTGTGGG 
GAACAGGATTTCTACTATGACTGTGGCATGTATTCCCACTTGCAGGGTAACCTGACTGTCTGTGACTGCTGTGAGTCATTT 
CTTTTGACAAAACCAGTATCATGCAAACACTTAATAAAATCGCACAGTTGTCCTGTATTGACAGCGGCCTCTTGCCAGAGG 
CCAGAGGCCTACTGGTCTGATTGTGGTACACAGTCAGCCCATTCTGACTATGCAGATGAAGAAGATTCCTTTGTCTCAGAC 
AGCTCTGACCAGGTGCAGGCCTGTGGACGAGCCTGCTTCTACCAGAGTCGTGGATTCCCTCTGGTGCGCTATGCTTACAAT 
CTACAGAGAGTCAGAGACTGA 



>LGR4 amino acid sequence (SEQ ID NO: 02) 

MPGPLGLLCFLALGLLGSAGPSGAAPPLCAAPCSCDGDRRVDCSGKGLTAVPEGLSAFTQALDISMNNITQLPEDAFKSFP 
FLEELQLAGNDLSLIHPKALSGLKELEO/LTLQNNQLRTVPSEAIHGLSALQSLRLDANHITSVPEDSFEGLVQLRHLWLDD 
NSLTEVPVRPLSNLPTLQALTLALNNISSIPDFAFTNLSSLVVLHLHNNKIKSLSQHCFDGLDNLETLDLNYNYLDEFPQA 
IKALPSLKELGFHSNSISVIPDGAFGGNPLLRTIHLYDNPLSFVGNSAFHNLSDLHCLVIRGASLVQWFPNLTGTVHLESL 
TLTGTKISSIPDDLCQNQECMLRTLDLS YNNIRDLPSFNGCRALEEISLQRNQISLIKENTFQGLTSLRILDLSRNLIREIH 
SGAFAKLGTITNLDVSFNELTSFPTEGLNGLNQLKLVGNFKLKDALAARDFANLRSLSVPYAYQCCAFWGCDSLCKLNTED 
NSPQEHSVTKEKGAT DAANVTSTAENEEHSQI I IHCTPSTGAFKPCEYLLGSWMIRLTVWFI FLVALLFNLLVILTVFASC 
SSLPASKLFIGLISVSNLLMGI YTGILTFLDAVSWGRFAEFGIWWETGSGCKVAGSLAVFSSESAVFLLTLAAVERSVFAK 
DLMKHGKSSHLRQFQVAALLALLGAAVAGCFPLFHGGQYSASPLCLPFPTGETPSLGFTVTLVLLNSLAFLLMAI IYTKLY 
CNLEKEDLSENSQSSVIKHVAWLIFTNCI FFCPVAFFSFAPLITAISISPEIMKSVTLIFFPLPACLNPVLYVFFNPKFKE 
DWKLLKRRVTRKHGSVSVSISSQGGCGEQDFYYDCGMYSHLQGNLTVCDCCESFLLTKPVSCKHLIKSHSCPVLTAASCQR 
PEAYWSDCGTQSAHSDYADEEDSFVSDSSDQVQACGRACFYQSRGFPLVRYAYNLQRVRD 
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nucleotide sequence of LGR5 (total 2082 nucleotides) (SEQ XD NO: 03) 

CTACATCTCCATAACAATAGAATCCACTCCCTGGGAAAGAAATGCTTTGATGGGCTCCACAGCCTAGAGACTTTAGATTTA 
AATTACAATAACCTTGATGAATTCCCCACTGCAATTAGGACACTCTCCAACTTAAAGGAACTAGGATTTCATAGCAACAAT 
ATCAGGTCGATACCTGAGAAAGCATTTGTAGGCAACCCTTCTCTTATTACAATACATTTCTATGACAATCCCATCCAATTT 
GTTGGGAGATCTGCTTTTCAACATTTACCTGAACTAAGAACACTGACTCTGAATGGTGCCTCACAAATAACTGAATTTCCT 
GATTTAACTGGAACTGCAAACCTGGAGAGTCTGACTTTAACTGGAGCACAGATCTCATCTCTTCCTCAAACCGTCTGCAAT 
CAGTTACCTAATCTCCAAGTGCTAGATCTGTCTTACAACCTATTAGAAGATTTACCCAGTTTTTCAGTCTGCCAAAAGCTT 
CAGAAAATTGACCTAAGACATAATGAAATCTACGAAATTAAAGTTGACACTTTCCAGCAGTTGCTTAGCCTCCGATCGCTG 
AATTTGGCTTGGAACAAAATTGCTATTATTCACCCCAATGCATTTTCCACTTTGCCATCCCTAATAAAGCTGGACCTATCG 
TCCAACCTCCTGTCGTCTTTTCCTATAACTGGGTTACATGGTTTAACTCACTTAAAATTAACAGGAAATCATGCCTTACAG 
AGCTGGATATCATCTGAAAACTTTCCAGAACTCAAGGTXATAGAAATGCCTTATGCTTACCAGTGCTGTGCATTTGGAGTG 
TGTGAGAATGCCTATAAGATTTCTAATCAATGGAATAAAGGTGACAACAGCAGTATGGACGACCTTCATAAGAAAGATGCT 
GGAATGTTTCAGGCTCAAGATGAACGTGACCTTGAAGATTTCCTGCTTGACTTTGAGGAAGACCTGAAAGCCCTTCATTCA 
GTGCAGTGTTCACCTTCCCCAGGCCCCTTCAAACCCTGTGAACACCTGCTTGATGGCTGGCTGATCAGAATTGGAGTGTGG 
ACCATAGCAGTTCTGGCACTTACTTGTAATGCTTTGGTGACTTCAACAGTTTTCAGATCCCCTCTGTACATTTCCCCCATT 
AT^ACTGTTAATTGGGGTCATCGCAGCAGTGAACATGCTCACGGGAGTCTCCAGTGCCGTGCTGGCTGGTGTGGATGCGTTC 
ACTTTTGGCAGCTTTGCACGACATGGTGCCTGGTGGGAGAATGGGGTTGGTTGCCATGTCATTGGTTTTTTGTCCATTTTT 
GCTTCAGAATCATCTGTTTTCCTGCTTACTCTGGCAGCCCTGGAGCGTGGGTTCTCTGTGAAATATTCTGCAAAATTTGAA 
ACGAAAGCTCCATTTTCTAGCCTGAAAGTAATCATTTTGCTCTGTGCCCTGCTGGCCTTGACCATGGCCGCAGTTCCCCTG 
CTGGGTGGCAGCAAGTATGGCGCCTCCCCTCTCTGCCTGCCTTTGCCTTTTGGGGAGCCCAGCACCATGGGCTACATGGTC 
GCTCTCATCTTGCTCAATTCCCTTTGCTTCCTCATGATGACCATTGCCTACACCAAGCTCTACTGCAATTTGGACAAGGGA 
GACCTGGAGAATATTTGGGACTGCTCTATGGTAAAACACATTGCCCTGTTGCTCTTCACCAACTGCATCCTAAACTGCCCT 
GTGGCTTTCTTGTCCTTCTCCTCTTTAATAAACCTTACATTTATCAGTCCTGAAGTAATTAAGTTTATCCTTCTGGTGGTA 
GTCCCACTTCCTGCATGTCTCAATCCCCTTCTCTACATCTTGTTCAATCCTCACTTTAAGGAGGATCTGGTGAGCCTGAGA 
AAGCAAACCTACGTCTGGACAAGATCAAAACACCCAAGCTTGATGTCAATTAACTCTGATGATGTCGAAAAACAGTCCTGT 
GACTCAACTCAAGCCTTGGTAACCTTTACCAGCTCCAGCATCACTTATGACCTGCCTCCCAGTTCCGTGCCATCACCAGCT 
TATCCAGTGACTGAGAGCTGCCATCTTTCCTCTGTGGCATTTGTCCCATGTCTCTAA 



>amino acid sequence of LGR5 (total 693 amino acids) (SEQ ID NO: 04) 

I^LHNNRIHSLGKKCFD^^ 

VGRSAFQHLPELRTLTLNGASQITEFPDLTGTANLESLTL 
QKIDLRH^IYEIKVDTFQQLLSLRSLNIA^^ 

SWISSENFPELKVIEMPYAYQCCAFGVCENAYKISNQWNKGDNSSMD^ 
VQCSPSPGPFKPCEHLI^GWLIRIGWTIAVI^TCN^ 

TFGSFARHGAWWENGVGCHVIGFLSIFASESSVFLLTLAAIjERGFSVKYSAKFETKAPFSSLKVIILLCAL 
IA3GSKYGASPLCLPLPFGEPSTMGYMVALIL3^SLC^ 
VAFLSFSSLINLTFISPEVIKFILLVWPLPACI^PL^^ 
DSTQALVTFTSSSITYDLPPSSVPSPAYPVTESCHLSSVAFVPCL 
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>Final LGR7 (LGR7-Long variant) full length sequence (2467 nt) (SEQ ID NO:05). 

GAAAGGAGGAAAGAAAAAAAGAGGAATGGAAAGAGACAGAGAAAGGAAATGGGAGTGGAAGGAGGGAGGACTGCTTT 
GTAACTGCTAAGATTGCAGACAGAAATAGCACACAACCACTGTGAGCTGTATGCGATTCAGAAACCAAGACCAAATT 
TTGCTCACTTTCATTAATCAGTTGCTCAGATAGAAGGAAATGACATCTGGTTCTGTCTTCTTCTACATCTTAATTTT 
TGGAAAATATTTTTCTCATGGGGGTGGACAGGATGTCAAGTGCTCCCTTGGCTATTTCCCCTGTGGGAACATCACAA 
AGTGCTTGCCTCAGCTCCTGCACTGTAACGGTGTGGACGACTGCGGGAATCAGGCCGATGAGGACAACTGTGGAGAC 
AACAATGGATGGTCCATGCAATTTGACAAATATTTTGCCAGTTACTACAAAATGACTTCCCAATATCCTTTTGAGGC 
AGAAACACCTGAATGTTTGGTCGGTTCTGTGCCAGTGCAATGTCTTTGCCAAGGTCTGGAGCTTGACTGTGATGAAA 
CCAATTTACGAGCTGTTCCATCGGTTTCTTCAAATGTGACTGCAATGTCACTTCAGTGGAACTTAATAAGAAAGCTT 
CCTCCTGATTGCTTCAAGAATTATCATGATCTTCAGAAGCTGTACCTGCAAAACAATAAGATTACATCCATCTCCAT 
CTATGCTTTCAGAGGACTGAATAGCCTTACTAAACTGTATCTCAGTCATAACAGAATAACCTTCCTGAAGCCGGGTG 
TTTTTGAAGATCTTCACAGACTAGAATGGCTGATAATTGAAGATAATCACCTCAGTCGAATTTCCCCACCAACATTT 
TATGGACTAAATTCTCTTATTCTCTTAGTCCTGATGAATAACGTCCTCACCCGTTTACCTGATAAACCTCTCTGTCA 
ACACATGCCAAGACTACATTGGCTGGACCTTGAAGGCAACCATATCCATAATTTAAGAAATTTGACTTTTATTTCCT 
GCAGTAATTTAACTGTTTTAGTGATGAGGAAAAACAAAATTAATCACTTAAATGAAAATACTTTTGCACCTCTCCAG 
AAACTGGATGAATTGGATTTAGGAAGTAATAAGATTGAAAATCTTCCACCGCTTATATTCAAGGACCTGAAGGAGCT 
GTCACAATTGAATCTTTCCTATAATCCAATCCAGAAAATTCAAGCAAACCAATTTGATTATCTTGTCAAACTCAAGT 
CTCTCAGCCTAGAAGGGATTGAAATTTCAAATATCCAACAAAGGATGTTTAGACCTCTTATGAATCTCTCTCACATA 
TATTTTAAGAAATTCCAGTACTGTGGGTATGCACCACATGTTCGCAGCTGTAAACCAAACACTGATGGAATTTCATC 
TCTAGAGAATCTCTTGGCAAGCATTATTCAGAGAGTATTTGTCTGGGTTGTATCTGCAGTTACCTGCTTTGGAAACA 
TTTTTGTCATTTGCATGCGACCTTATATCAGGTCTGAGAACAAGCTGTATGCCATGTCAATCATTTCTCTCTGCTGT 
GCCGACTGCTTAATGGGAATATATTTATTCGTGATCGGAGGCTTTGACCTAAAGTTTCGTGGAGAATACAATAAGCA 
TGCGCAGCTGTGGATGGAGAGTACTCATTGTCAGCTTGTAGGATCTTTGGCCATTCTGTCCACAGAAGTATCAGTTT 
TACTGTTAACATTTCTGACATTGGAAAAATACATCTGCATTGTCTATCCTTTTAGATGTGTGAGACCTGGAAAATGC 
AGAACAATTACAGTTCTGATTCTCATTTGGATTACTGGTTTTATAGTGGCTTTCATTCCATTGAGCAATAAGGAATT 
TTTCAAAAACTACTATGGCACCAATGGAGTATGCTTCCCTCTTCATTCAGAAGATACAGAAAGTATTGGAGCCCAGA 
TTTATTCAGTGGCAATTTTTCTTGGTATTAATTTGGCCGCATTTATCATCATAGTTTTTTCCTATGGAAGCATGTTT 
TATAGTGTTCATCAAAGTGCCATAACAGCAACTGAAATACGGAATCAAGTTAAAAAAGAGATGATCCTTGCCAAACG 
TTTTTTCTTTATAGTATTTACTGATGCATTATGCTGGATACCCATTTTTGTAGTGAAATTTCTTTCACTGCTTCAGG 
TAGAAATACCAGGTACCATAACCTCTTGGGTAGTGATTTTTATTCTGCCCATTAACAGTGCTTTGAACCCAATTCTC 
TATACTCTGACCACAAGACCATTTAAAGAAATGATTCATCGGTTTTGGTATAACTACAGACAAAGAAAATCTATGGA 
CAGCAAAGGTCAGAAAACATATGCTCCATCATTCATCTGGGTGGAAATGTGGCCACTGCAGGAGATGCCACCTGAGT 
TAATGAAGCCGGACCTTTTCACATACCCCTGTGAAATGTCACTGATTTCTCAATCAACGAGACTCAATTCCTATTCA 
TGA 



>Final LGR7 (LGRJ-long variant, total 757 amino acids)(SEQ ID NO:06) 

MTSGSVFFYILIFGKYFSHGGGQDVKCSLGYFPCGNITKCLPQLLHCNGVDDCGNQADEDNCGDNNGWSMQFDKYFA 

SYYKMTSQYPFEAETPECLVGSVPVQCLCQGLELDCDETNLRAVPSV^ 

LYLQNNKITSISIYAFRGLNSLTKXiYLSHNRITFLKPGVFEDLHRL 

NVLTRLPDKPLCQHMPRLHWLDLEGNHIHNLRNLTFISCSNLTVLVMRKl^ 

NLPPLIFKDLKELSQLNLSYNPIQKIQANQFDYLVTCL^ 

VRS CKPNTDG I S S LENLLAS I IQRVFVWWS AVTCFGNI FVI CMRPYIRSENKLYAMS 1 1 SLCCADCLMGIYLFVIG 
GFDLKFRGEYNKHAQLWMESTHCQLVGSIJaLSTEVS 

FIVAFIPLSNKEFFKNYYGTNGVCFPLHSEDTESIGAQIYSVAIFLGINLAAFIIIVFSYGSMFYSVHQSAITATEI 

RNQVTCKEMILiAKRFFFIVFTDALCWIPIFVVKFLSLLQVEIPGTITSWVVIFILP^ 

RFWYNYRQRKSMDSKGQKTYAPSFIWEMWPLQEMPPELMKPDLFTYPCEMSLISQSTRLNSYS* 



FIG. 3 



WO 99/48921 



PCT/US99/06573 



4/8 

>Final LGR7 (LGR7-Short variant) full length sequence (3584 nt)(SEQ ID NO:07) 

CTGCTTTGTAACTGCTAAGATTGCAGACAGAAATAGCACACAACCACTGTGAGCTGTATGCGATTCAGAAACCAAGA 
CCAAATTTTGCTCACTTTCATTAATCAGTTGCTCAGATAGAAGGAAATGACATCTGGTTCTGTCTTCTTCTACATCT 
TAATTTTTGGAAAATATTTTTCTCATGGGGGTGGACAGGATGTCAAGTGCTCCCTTGGCTATTTCCCCTGTGGGAAC 
ATCACAAAGTGCTTGCCTCAGCTCCTGCACTGTAACGGTGTGGACGACTGCGGGAATCAGGCCGATGAGGACAACTG 
TGTGGTGGTTTTGTGCCAGTGCATGTCTTTGCCAGGTCTGGAGCTTGACTGGATGAAACCATTTACGAGTGTTCCAT 
CGGTTTCTTCAAATGTGACTGCAATGTCACTTCAGTGGAACTTAATAAGAAAGCTTCCTCCTGATTGCTTCAAGAAT 
TATCATGATCTTCAGAAGCTGGACCTGCAAAACAATAAGATTACATCCATCTCCATCTATGCTTTCAGAGGACTGAA 
TAGCCTTACTAAACTGTATCTCAGTCATAACAGAATAACCTTCCTGAAGCCGGGTGTTTTTGAAGATCTTCACAGAC 
TAGAATGGCTGATAATTGAAGATAATCACCTCAGTCGAATTTCCCCACCAACATTTTATGGACTAAATTCTCTTATT 
CTCTTAGTCCTGATGAATAACGTCCTCACCCGTTTACCTGATAAACCTCTCTGTCAACACATGCCAAGACTACATTG 
GCTGGACCTTGAAGGCAACCATATCCATAATTTAAGAAATTTGACTTTTATTTCCTGCAGTAATTTAACTGTTTTAG 
TGATGAGGAAAAACAAAATTAATCACTTAAATGAAAATACTTTTGCACCTCTCCAGAAACTGGATGAATTGGATTTA 
GGAAGTAATAAGATTGAAAATCTTCCACCGCTTATATTCAAGGACCTGAAGGAGCTGTCACAATTGAATCTTTCCTA 
TAATCCAATCCAGAAAATTCAAGCAAACCAATTTGATTATCTTGTCAAACTCAAGTCTCTCAGCCTAGAAGGGATTG 
AAATTTCAAATATCCAACAAAGGATGTTTAGACCTCTTATGAATCTCTCTCACATATATTTTAAGAAATTCCAGTAC 
TGTGGGTATGCACCACATGTTCGCAGCTGTAAACCAAACACTGATGGAATTTCATCTCTAGAGAATCTCTTGGCAAG 
CATTATTCAGAGAGTATTTGTCTGGGTTGTATCTGCAGTTACCTGCTTTGGAAACATTTTTGTCATTTGCATGCGAC 
CTTATATCAGGTCTGAGAACAAGCTGTATGCCATGTCAATCATTTCTCTCTGCTGTGCCGACTGCTTAATGGGAATA 
TATTTATTCGTGATCGGAGGCTTTGACCTAAAGTTTCGTGGAGAATACAATAAGCATGCGCAGCTGTGGATGGAGAG 
TACTCATTGTCAGCTTGTAGGATCTTTGGCCATTCTGTCCACAGAAGTATCAGTTTTACTGTTAACATTTCTGACAT 
TGGAAAAATACATCTGCATTGTCTATCCTTTTAGATGTGTGAGACCTGGAAAATGCAGAACAATTACAGTTCTGATT 
CTCATTTGGATTACTGGTTTTATAGTGGCTTTCATTCCATTGAGCAATAAGGAATTTTTCAAAAACTACTATGGCAC 
CAATGGAGTATGCTTCCCTCTTCATTCAGAAGATACAGAAAGTATTGGAGCCCAGATTTATTCAGTGGCAATTTTTC 
TTGGTATTAATTTGGCCGCATTTATCATCATAGTTTTTTCCTATGGAAGCATGTTTTATAGTGTTCATCAAAGTGCC 
ATAACAGCAACTGAAATACGGAATCAAGTTAAAAAAGAGATGATCCTTGCCAAACGTTTTTTCTTTATAGTATTTAC 
TGATGCATTATGCTGGATACCCATTTTTGTAGTGAAATTTCTTTCACTGCTTCAGGTAGAAATACCAGGTACCATAA 
CCTCTTGGGTAGTGATTTTTATTCTGCCCATTAACAGTGCTTTGAACCCAATTCTCTATACTCTGACCACAAGACCA 
TTTAAAGAAATGATTCATCGGTTTTGGTATAACTACAGACAAAGAAAATCTATGGACAGCAAAGGTCAGAAAACATA 
TGCTCCATCATTCATCTGGGTGGAAATGTGGCCACTGCAGGAGATGCCACCTGAGTTAATGAAGCCGGACCTTTTCA 
CATACCCCTGTGAT^ATGTCACTGATTTCTCAATCAACGAGACTCAATTCCTATTCATGACTGACTCTGAAATTCATT 
TCTTCGCAGAGAATACTGTGGGGGTGCTTCATGAGGGATTTACTGGTATGAAAATGAATACCACAAAATTAATTTAT 
AATAATAGCTAAGATAAATATTTTACAAGGACATGAGGAAAAATAAAAATGACTAATGCTCTTACAAAGGGAAGTAA 
TTATATCAATAATGTATATATATTAGTAGACATTTTGCATAAGAAATTAAGAGAAATCTACTTCAGTAACATTCATT 
CATTTTTCTAACATGCATTTATTGAGTACCCACTACTATGTGCATAGCATTGCAATATAGTCCTGGAAGTAGACAGT 
GCAGAACCTTTCAATCTGTAGATAGTGTTTAATGACAAAAGACTATACAAAGTCCATCTGCAGTTCCTAGTTTAAAG 
TAGAGCTTTACCTGTCATGTGCATCAGCAAGAATCATAGGCACTTTTAAATAAAGGTTTAAAGTTTTGGAATACTCA 
GTGTATTTGCATCATAGAAAATGTCTGACTGTTTGCAAAATAATATTCTGTTTTAAGAATCCATCTTACCTCTCTTT 
AAGTTTCCATACACTTGAGAGCCAACACAACATATTTATTACTAAAAAGATGCTTTGCTAGAAACTCAAAAACAGCA 
CTTCTTTTGGCACTTCCTGCCCAGTTTTCTCTTTGCTTTAAATGAACATCATCATATGGAATTGGAATAGGAGAGTA 
TGAGTACGGCAGAGAAGTGGATCAGAAAAACTAGAATGAGGATAAACATTTACATTAGTGGAAACTCCTGAAATAAA 
TCCTTGTATTGTCAGTTAACTGATTTTCAACAAGGATGCCAAGACAAAAAGGCTTTTCAACAAACCGTGCTGTTTTA 
AGAACAGACCTAAGTGGTTTAATTCACCCACTTTAGATGGGTGAATGTTATGGTGTGTGAAATATCTCAGTAAAGCA 
GTTAAAAGGAAAAAGAGCTGGAATGCACTGATTCAGGAACTTAATTTCAGGAAGGAAAGGTCTGTATGTACACATTT 
CACTTTAAGCAGAAAATCTTTCTTCAAGAAATGACTTTACTTTCTCTTTGCACTGCCAGCACGTGAGATACTAACTT 
TTTAACTAGTTGTTCTTCTCTAGTCTCTACGTTATTAGNATTTTTTGCTTTCATAATGTGAAACCTTTAAGCAGGAG 
AAGAAAATGTTTTCAGATAGTTTCAAATACNCCAAAAATGTTTGCAACACAAAAATACTGGAAT CNAAC C ATAATG C 
CCTTATTGAATATATAGTTGTATAGNTTTGTTCTGAAAACCC 

>Final LGR7-S ORF (722 amino acids) (SEQ ID NO:08) 

MTSGS VFFYI L I FGKYFSHGGGQDVKCS LGYFPCGNITKCLPQLLHCNGVDDCGNQADEDNCVWIjCQCMS LPGLEL 

DWMKPFTSVPSVSSNVTAMSLQWNLIRKLPPDCFra 

KPGVFEDLHRLEWLIIEDNHLSRISPPTFYGLNSLILLVL 

FISCSNLTVLVMRKNKINHLNENT^ 

KLKSLSLEGIEISNIQQRMFRPLMNLSHIYFKKFQYCGYAPHWSCKPNTDGISSLENLIASIIQRVFVWWSAVTC 
FGNIFVICMRPYIRSENKLYAMSIISLCCADCI^^ 

VSVLLLTFLTLEKYICIVYPFRCWPGKCRTITVLILIWITGFIVAFIPLSNKEFFKNYYGTNGVCFPLHSEDTESI 
GAQI YS VAI FLG INLiAAFI 1 1 VFS YGSMFYS VHQSAITATE IRNQVKKEMI31AKRFFFI VFTDALCWI P I FWKFLS 
LLQVEIPGTITSWVVIFILPINSALNPILYTLTTRPFKEMIH^ 
PPELMKPDLFTYPCEMSLISQSTRLNSYS + 
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> Alignment of LGR7-L with LGR7-S 

Query=LGR7-L 

Sbjct=LGR7-S 

Query : 1 MTSGSVFFYILIFGKYFSHGGGQDVKCSLGYFPCGNITKCLPQLLHCNGVDDCGNQADED 60 

MTSGSVFFYILIFGKYFSHGGGQDVKCSLGYFPCGNITKCLPQLLHCNGVDDCGNQADED 
Sbjct : 1 MTSGSVFFYILIFGKYFSHGGGQDVKCSLGYFPCGNITKCLPQLLHCNGVDDCGNQADED 60 

Query: 61 NCGDNNGWSMQFDKYFASYYKMTSQYPFEAETPECLVGSVPVQCLCQ GLELDCDETN 117 



Query: 118 LRAVPSVSSNVTAMSLQWNLIRKLPPDCFKNYHDLQKXYLQNNKITSISIYAFRGLNSLT 177 

+VPSVSSNVTAMSLQWNLIRKLPPDCFKNYHDLQKL LQNNKI TS I S I YAFRGLNSLT 
Sbjct: 83 FTSVPSVSSNVTAMSLQWNLIRKLPPDCFKNYHDLQKLDLQNNK^ 142 

Query: 178 KLYLSHl^ITFLKPGVFEDLHRLEWLIIEDNHLSRISPPTFYGLNSLILLVLMNNyLTRL 237 

KLYLSHNRITFLKPGVFEDLHRLEWLIIEDiraLSRISP 
Sbjct: 143 KLYLSHNRITFLKPGVFEDLHRLEWLIIEDNHLSRISPPTFYGLNSLILLVLMNTI^ 202 

Query: 238 PDKPLCQHMPRLHWLDLEGiraiHNIiRNLTFISC^ 2 97 

PDKPLCQHMPRIJIWLDLEGNHIHNLRNLTFISCSNLTVLVMRKNK^ 
Sbjct: 203 PDKPLCQHMPRLHWLDLEGNHIHNLRNLTFISCSNLTVLVMRKNK^ 262 

Query: 298 DE^LGSNKIENLPPLIFKDLKELSQLl^SYNPIQKIQANQFDYLVKLKSLSLEGIEISN 357 

DEI^LGSNKIENLPPLIFKDLKELSQI^SYNPIQKIQANQFDYLVKLKSIiSLEGIEISN 
Sbjct: 263 DEIJ^LGSNKIENLPPLIFKDLK^LSQI^SYNPIQKIQANQFDYLVKLKSLSLEGIEISN 322 

Query: 358 IQQRMFRPIJYEl^SHIYFKKFQYCGYAPHWSCK 417 

IQQRMFRPLMNLSHI YFKKFQYCGYAPHVRS CKPNTDG I S SLENLLAS I IQRVFVWWSA 
Sbjct: 323 IQQRMFRPLMNLSHI YFKKFQYCGYAPHVRS CKPNTDG I S SLENLLAS I IQRVFVWWSA 382 

Query: 418 VTCFGNI FVI CMRPYIRSENKLYAMS I ISLCCADCLMG I YLFVIGGFDLKFRGEYNKHAQ 477 

VTCFGNI FVI CMRPYIRSENKLYAMS I ISLCCADCLMG I YLFVIGGFDLKFRGEYNKHAQ 
Sbjct: 383 VTCFGNI FVICMRPYIRSENKLYAMS I ISLCCADCLMG I YLFVIGGFDLKFRGEYNKHAQ 442 

Query: 478 LWMESTHCQLVGSLAILSTEVSVLLLTFLTLEKYICIVYPFRCVRPGKCRTITVLILIWI 537 

LWMESTHCQLVGSIA.ILSTEVSVLLLTFLTLEKYICIVYPFRCVRPGKCRTITVLILIWI 
Sbjct: 443 LWMESTHCQLVGSLAILSTEVSVLLLTFLTLEKYICIVYPFRCVRPGKCRTITVLILIWI 502 

Query: 538 TGFIVAFIPLSNKEFFKNYYGTNGVCFPLHSEDTESIGAQIYSVAIFLGINLAAFIIIVF 597 

TGFIVAFIPLSNKEFFKNYYGTNGVCFPLHSEDTESIGAQIYSVAIFLGINLAAFIIIVF 
Sbjct: 503 TGFIVAFIPLSNKEFFKNYYGTNGVCFPLHSEDTESIGAQIYSVAI FLGINLAAFIIIVF 562 

Query: 598 SYGSMFYSVHQSAITATEIRNQVKKEMILAKRFFFIVFTDALCWIPIFVVKFLSLLQVEI 657 

SYGSMFYSVHQSAITATEIRNQVKKEMILAKRFFFIVFTDALCWIPIFVVKFLSLLQVEI 
Sbjct: 563 SYGSMFYSVHQSAITATEIRNQVKKEMIIAKRFFFIVFTDALCWIPIFVVKFLSLLQVEI 622 

Query: 658 PGTITSWWIFILPINSALNPILYTLTTRPFKEMIHRFWYNYRQRKSMDSKGQKTYAPSF 717 

PGTITSWVVIFILPINSALNPILYTLTTRPFKEMIHRFWYNYRQRKSMDSKGQKTYAPSF 
Sbjct: 623 PGTITSWVVIFILPINSALNPILYTLTTRPFKEMIHRFWYNYRQRKSMDSKGQKTYAPSF 682 

Query: 718 IWVEMWPLQEMPPELMKPDLFTYPCEMSLISQSTRLNSYS 757 

I WVEMWPLQEMP PELMKPDLFTYPCEMS LI SQSTRLNS YS 
Sbjct: 683 I WVEMWPLQEMP PELMKPDLFTYPCEMSL I SQSTRLNSYS 722 



Sbjct : 



61 



NC 
NC 



V V C C GLELD + 

VWLCQCMSLPGLELDWMKP- 82 
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FIG. 6 

Signal peptide 

LGR4 MPGPLGLLCFLALGLLGSAGPSGA 
LGR 5 MDTS RLGVLLS LP VLLQLATG 
LHR MKQRFSALQLLKLLLLLQPPLPRA 
FSHR MALLLVS LLAFLSLGSG 
TSHR MRPADLLQLVLLLDLPRDLGG 

N- flank cysteine-rich sequence 

LGR4 APPL AA-P S DGDR RVD SGKGLTAVPEGLSAFTQA 

LGR 5 GSSPRSGVLLRG P-TH H EPDGRMLLRVD SDLGLSELPSNLSVFTSY 
LHR LREAL P-EP N VPDG--ALR-- PGPTAGLTR 

FSHR HHRI H SNRVFL QESKVTEIPSDLPRNAIE 

TSHR MG SSPP E HQEED- - FRVT KDIQRIPSLPPSTQT 

Leucine-rich repeats 

4 ► « ► M 

LGR4 DISMNNITQLPED KSFPFLEELQLAGN -- SL HPKALSG KE KVLTLQ -- Q 

LGR5 DLSMNNISQLLPNPLPSLHFLEELRLAGNA-- TY PKGA TG YS KVLMLQ -- Q 

LHR SLAYLPVKVIPSQ RGLNEVIKIEISQI S- ER EANA DN LN SEILIQ TK - 

FSHR RFVLTKLRVIQKG SGFGDLEKIEISQN V- EV EADV SN PK HEIRIEKAN - 

TSHR KLIETHLRTIPSH SNLPNISRIYVSI - VT QQLESHS YN SKVTHIEIR TR - 

► ^ 

LGR4 RTV- SE IHG SA QS RLDA H- TSV EDS - - FEGLVQLRH WLD S-L- EV VR 

LGR 5 RHV- TE LQN RS QS RLDA H- SYV P- SC- FSGLHSLRH WLD A-L- E VQ 

LHR RYIE -G FIN PG KY SIC- TG RKF DVTKVFSSESNFI - EIC LHI - T GN 

FSHR LYIN -E FQN PN QY LIS - TG KHL DVHK- IHSLQKVL- DIQ INIH - ERN 

TSHR TYID -D LKE PL KF GIF- TGLKMF DLTK- VYSTDIFFI EIT PYM- S VN 

► M ► < 

LGR 4 PLSN P-TLQA T AL NISSIPDF T LSS W H HN K-IKSLSQHC D LDN-LE 
LGR 5 A RS S-ALQAMT AL KIHHIPDY G LSSWW H HN R-IHSLGKKC D LHS-LE 
LHR A QGMNNESVT K YG GFEEVQSH - GTT TS E KE VHLEKMHNGA R A-TGPK 

FSHR S VG SFESVI W NK GIQEIHNC - GTQ DE N SD NNLEELPNDV H A-SGPV 
TSHR A QG CNETLT K YN GFTSVQGY - GTK DAVY NK KYLTVIDKDA G VYSGPS 

► <« ► M 

LGR 4 T LNYNYLDEF Q-AIKA PS KELGFHSNS ISVI D-GA GGNPL RTIH - DNPLS 
LGR 5 T LNYNNLDEF T-AIRT SN KELGFHSNNIRSI E-KA VGNPS ITIHF- DNPIQ 

LHR T ISSTKLQAL SYGLESIQR I-ATS-SYSLKKL SRET V-N-- LEAT T 

FSHR I ISRTRIHSL SYGLEN KK R-ARSTYN-LKKL TLEKLVA ME AS T 

TSHR L VSQTSVTAL SKGLEH KE I - ARNTWT - LKKL LSLS LH TRAD S 

► «4 

LGR4 FVGNSAFHNLSDLHCLVIRGASLVQWFPNLTGTVHLESLTLTGTKISSIPDDLCQNQKML 
LGR 5 FVGRSAFQHLPELRTLTLNGASQITEFPDLTGTANLESLTLTGAQISSLPQTVCNQLPNL 

LHR 

FSHR 

TSHR 
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► < ► <« ► <« 

LGR4 RTLDLSYNNIRDLPSFNGCRALEEISLQRNQISLIKENTFQGLTSLRILDLSRNLIREIH 

LGR5 QVLDLSYNLLEDLPSFSVCQKLQKIDLRHNEIYEIKVDTFQQLLSLRSLNLAWNKIAIIH 

LHR 

FSHR 

TSHR 

^ 

LGR4 S GAFAKLGT I TNLDVS FNELTS FPTEGLNGLNQLK 
LGR5 PNAFS TLP S L I KLDLS S NLL S S FP I TGLHGLTHLK 

LHR 

FSHR 

TSHR 

C- flank cysteine-rich sequence 

LGR4 LVGNFKLKDALAARDFANLRSLSV YAYQ WGCDS LCKLNTEDNS PQEHS VTKEKGA 

LGR5 LTGNHALQSLISSENFPELKVIEM YAYQ GVCENAYKI SNQWNKGDNSSMDDLHKK 

LHR - -SH RNLPTKEQNFSHS ISENFSKQCESTVR 

FSHR - - SH ANWRRQ I S E LHP I CNKS I LRQE VD YMT 

TSHR - - SH KNQKKI RG I LE S LMCNE S SMQS LRQRK 

LGR4 TDAANVTS TAENE HS 

LGR5 DAGMFQAQDERDL DF 

LHR KVSNKTLYS SMLA SE 

FSHR QTRGQRS SLAEDN SS 

TSHR SVNALNS PLHQEY ENLGDS IVGYKEKSKFQDTHNNAHYYVFFEEQEDE I IGFGQELKNP 

LGR4 QIIIH T STGA K YLLGSWMI 

LGR5 LLDFEEDLKALHSVQ S SPGP K HLLDGWLI 

LHR LSGWDYE YGFCLPKTPR - A EPDA N DIMGYDFL 

FSHR YS RG FDMT YTE FD YDLCNE WDVT S KPDA N DIMGYNIL 

TSHR QEETLQ AFDSH YD YT I CGDS EDMV T KSDE N DIMGYKFL 

Transmembrane 

TM 1 TM 2 

LGR4 LTV F FLV LLF LL ILTVFA CSS PASKLFIGLI SVSNLLM IYTGILTFL AVSW 

LGR5 IGV T AV LTC AL TSTVFR PLYISPIKL I GVI AAVNMLT VSSAVL G AF F 

LHR VLI L NI IMG MT LFVLLT RYK TVPRF MCNLS FADFCM LYLLLI S SQ K 

FSHR VLI F SI ITG II LVILTT QYK TVPRF MCNLAFADLC I IYLLLI S IH K 

TSHR IW FVSL LLG VF LLILLT HYK NVPRF MCNLAFADFCM MYLLLI S LY H 

TM 3 



LGR4 GRFAEFG WE S KV SLA S SA FL LAAV SVFAKDLMKHGKSSH QF 

LGR5 GSFARHGAW EN V HVI LSI S FL LAA GFSVKYSAKFET APFSSL 

LHR GQYYNHA D Q S ST FT L YT VIT WHTITYAIHLDQ LR HA 

FSHR SQYHNYA D Q A DA FT L YT AIT WHTITHAMQLDC VQ HA 

TSHR SEYYNHA D Q P NT FT L YT VIT WYAITFAMRLDR IR HA 
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TM 4 












TM 


5 




LGR4 


QVAALLALLGAAVAGCF 


FHGGQ 


SASPL 


FPTGETPSLGFTVTLVL 


SL 


LLMA 


LGR5 


KVI ILLCALLALTM 


AV 


L G 


K 


GAS PL 


LPFGEPSTMG 


MVALIL 


SLC 


LMMT 


LHR 


ILIMLGGWLFSSLI 


ML 


V V 


N 


MKVSI 


F MDVETTLSQV 


ILTILI 


W 


FIIC 


FSHR 


AS VMVMGWI FAFAA 


LF 


IF I 


S 


MKVSI 


MDIDSPLSQL 


VMSLLV 


VL 


WIC 


TSHR 


CAIMVGGWVCCFLL 


LL 


V I 


S 


AKVSI 


MDTETPLALA 


IVFVLT 


IV 


VIVC 



TM 6 



LGR4 


II 


T 


L 


CNL - EKEDLS ENSQS S V I 


HV 


W 


NC I FFC 


VA 


FSFAPLITAIS 


SPEI 


LGR5 


IA 


T 


L 


CNL - DKGDLEN I W 


CSMV 


HI 


L L 


NCILNC 


VA 


LSF 


SLINLTF 


SPEV 


LHR 


AC 


I 


I 


FAVRNP E LMATNK 


TKIA 


KM 


I 


DFTCMA 


IS 


FA I 


AAFKVPL 


TVTN 


FSHR 


GC 


IHI 


LTVRNPNIVSSSS 


TRIA 


RM 


M 


DFLCMA 


IS 


FAI 


ASLKVPL 


TVSK 


TSHR 


CCHV 


I 


I TVRNPQYNPGDK 


TKIA 


RM 


V 


DFICMA 


IS 


YAL 


AILNKPL 


TVSN 



TM 7 



LGR4 


M 


SVTLI F 


LPA 


L 


V 


VF 


N 


LGR5 


I 


FI LVW 


LPA 


L 


L 


IL 


N 


LHR 


S 


VL VL Y 


INS 


A 


F 


AI 


T 


FSHR 


A 


IL VL H 


INS 


A 


F 


AI 


T 


TSHR 


S 


IL VL Y 


LNS 


A 


F 


AI 


T 



C- terminal tail 



LGR4 


PK 


KE 


WKL 


KRRVTRKHGS VS VS I S SQGGCGEQDFYYDCGMYSHLQGNLTVCDCCES FL 


LGR5 


PH 


KE 


LVS 


RKQTYWTRSKHPSLMSINSDDVEKQSCDSTQALVTFTSSSITYDLPPSS 


LHR 


KT 


QR 


FFL 


LSKFGCCKRRAELYRRKDFSAYTSNCKNGFTGSNKPSQSTLKLSTLHCQG 


FSHR 


KN 


RR 


FFI 


LS KCGCYEMQAQ I YRTETS S TVHNTHPRNGHCS S APR VTNGS TY I LVPLS 


TSHR 


KA 


QR 


VFI 


LS KFG I CKRQAQAYRGQRVPPKNS TD I QVQKVTHDMRQGLHNMEDVYEL I 



LGR4 LTKPVSCKHLIKSHSCPVLTAASCQRPEAYWSDCGTQSAHSDYADEEDSFVSDSSDQVQA 

LGR 5 VPS PAYPVTES CHL S S VAFVPCL 

LHR TALLDKTRYTEC 

FSHR HLAQN 

T S HR ENS HLTPKKQGQ I S EE YMQTVL 

LGR4 CGRAC F YQS RG FPLVR YAYNLQR VRD 



FIG. 6 (CONT) 



